Trainable Speech Synthesis Based on Trajectory Modeling of Line Spectrum Pair Frequencies

نویسنده

  • Bryan L. Pellom
چکیده

In this paper we present a novel speaker-dependent speech synthesis algorithm based on modeling temporal trajecto-ries of the speech Line Spectrum Pair Frequencies (LSFs). The overall approach is integrated into a pitch-synchronous analysis/synthesis framework and is shown to allow the synthesis of speaker-dependent voice characteristics through an automatic parameter learning algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spectral normalization employing hidden Markov modeling of line spectrum pair frequencies

This paper proposes a spectral normalization approach in which the acoustical qualities of an input speech waveform are mapped onto that of a desired neutral voice. Such a method can be e ective in reducing the impact of speaker variability such as accent, stress, and emotion for speech recognition. In the proposed method, the transformation is performed by modeling the temporal characteristics...

متن کامل

Model-based speech separation with single-microphone input

Prior knowledge of familiar auditory patterns is essential for separating sound sources in human auditory processing. Speech recognition modeling is one probabilistic way for capturing these familiar auditory patterns. In this paper we focus on separating speech sources with a single-microphone input only. A model-based algorithm is proposed to generate target speech by estimating its spectral ...

متن کامل

Integration of Intonation in Trainable Speech Synthesis

Current developments in artificial speech synthesis place more emphasis on spectral continuities and diverse prosodic effects. The trainable HMM-based speech synthesis method has generated more continuous spectral structure than unit selection method in recent study, but the pitch contour generated by HMM-based method trends to be over-smoothed and lacks syllable variance in Chinese. In this pa...

متن کامل

Integration of Intonation in F0 Trajectory prediction using MSD-HMMs

Present study in speech synthesis places more and more emphasis on the spectral continuities and diverse prosodic effects. The trainable HMM-based speech synthesis method tends to generate more continuous spectral structures than the traditional unit selection method. However, the F0 trajectory generated by HMM-based speech synthesis is often excessively smoothed and lacks prosodic variance. Th...

متن کامل

Speech coding and synthesis using parametric curves

Accurate modeling of co-articulation, the contextsensitive merging of the boundaries between allophones in continuous speech, is vital for natural sounding speech synthesis. This paper describes initial research investigating the use of Bézier Curves to form models of co-articulation in human speech. A 12th order, pitch synchronous line spectral pair (LSP) [1] analysis is performed on a corpus ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998